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Abstract 


A  two-sample  test  is  studied  which  rejects  the  null  hypothesis  of  equal 
population  medians  when  two  Wilcoxon  distribution  free  confidence  intervals 
are  disjoint.  A  confidence  interval  for  the  difference  in  population  medians 
is  constructed  by  subtracting  the  endpoints  of  two  one-sample  confidence 
intervals.  Two  different  ways  to  select  the  one-sample  intervals  are 
presented.  A  solution  that  specifies  equal  confidence  coefficients  for  the 
one-sample  intervals  is  recommended .  All  solutions  are  shown  to  have  the  same 
asymptotic  (Pitman)  efficiency  as  the  Mann-Wh i tney  two-sample  test. 


Key  Words  and  Phrases :  Mann-Whitney-Wilcoxon  test',  nonparametric  test',  sign 
test,  nonparametric  confidence  intervals,  notched  box  plots. 
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1 .  Introduction  ! 

i 

i 

The  purpose  of  this  paper  is  to  discuss  a  two-sample  test  and  confidence 
interval  based  on  the  comparison  of  one-sample  confidence  intervals.  This 
approach  is  appealing  because,  along  with  the  two-sample  inference,  a 
statistical  description  of  the  individual  samples  is  provided  and  a  graphical 
representation  of  the  two  intervals  provides  an  effective  presentation  of  the 
inference.  (See  Figure  1).  The  test  rejects  the  null  hypothesis  of  equal 
population  medians  if  the  one-sample  intervals  are  disjoint.  The  confidence 
interval  for  the  difference  in  population  medians  is  constructed  by 
subtracting  the  endpoints  of  one  interval  from  the  opposite  endpoints  of  the 
other. 

McGill,  Tukey  and  Larsen  (1978)  discuss  a  similar  idea  when  presenting 
notched  box  plots  as  a  way  of  displaying  relevant  information  about  a 
population.  The  notch  in  their  box  is  a  confidence  interval  of  the  form 
M±1.7SE  for  the  population  median,  where  M  is  the  sample  median  and  SE  is  a 
sample  estimate  of  the  asymptotic  standard  error  of  the  sample  median  when 
sampling  from  a  normal  population.  The  factor  1.7  was  empirically  chosen  to 
produce  a  two-sample  test  with  a  two-sided  level  of  roughly  5%  for  several 
distributions.  It  is  difficult  to  interpret  the  individual  intervals 
(notches)  and  "level"  of  the  test  since  they  are  based  on  such  rough 
approximations.  McKean  and  Schrader  (1983),  in  a  simulation  study,  found  the 
confidence  coefficient  for  this  procedure  to  be  highly  unstable  for  small  to 
moderate  sample  sizes,  even  at  the  normal  distribution.  Hettmansperger 
(1984)  suggested  notching  the  box  plots  with  s ign- intervals .  These 
intervals  have  order  statistics  from  the  sample  as  endpoints  and  are  obtained 
by  inverting  the  acceptance  region  of  a  two-sided  sign  test.  These  intervals 


are  distribution  free,  and  have  a  well-defined  asymptotic  distribution  theory 
that  does  not  require  normality  or  synmetry  assumptions.  He  also  showed  that 
this  procedure  has  the  same  asymptotic  efficiency  as  Mood's  (1950)  median 
test  which  corresponds  to  a  particular  choice  of  sign-intervals.  Tableman 
(1984)  developed  the  small  sample  distribution  theory  for  the  comparison  of 
sign- intervals  so  that  one  can  easily  obtain  the  exact  level  and  confidence 
coefficient  of  the  inference. 

For  the  case  of  symmetric  populations,  we  propose  notching  the  box  plots 
with  one-sample  Wilcoxon- intervals  which  are  defined  in  Section  2.  It  is 
shown  that  these  intervals  are  distribution  free,  and  exact  confidence 
coefficients  can  be  obtained  from  a  table  of  the  Wilcoxon  (1945)  signed-rank 
distribution.  Several  large  sample  results  concerning  its  endpoints  are 
presented.  In  Section  3,  the  two-sample  inference  is  developed  and 
discussed.  It  is  shown  that  the  suggested  procedures  produce  two-sample 
tests  that  have  the  same  Pitman  efficiency  as  the  Mann-Whitney-Wilcoxon 
(1945,  1947)  two-sample  test,  when  sampling  from  symmetric  distributions. 
Hence,  the  proposed  procedure  can  be  much  more  efficient  than  the  procedure 
based  on  sign-intervals.  Two  choices  of  confidence  coefficients  for  the  ,«o 
intervals  are  discussed  and  an  example  is  presented.  In  the  final  section, 
we  discuss  the  effects  of  asymmetry  on  the  two-sample  inference.  Although  in 
simulation  studies  we  have  not  found  the  size  of  the  test  to  be  effected,  a 
reduction  in  power  (Pitman  efficiency)  of  the  proposed  test  can  result. 

Hence,  with  strong  asynmetry,  it  may  no  longer  be  a  competitor  to  the 
Mann-Whitney-Wilcoxon  test.  However,  in  many  models  it  may  not  be 
unreasonable  to  assume  synmetry  or  approximate  synmetry,  especially  after  a 
transformation  has  been  applied. 


2.  The  Wilcoxon- Interval 
Assumptions  2.1: 

(i)  Suppose  Xp  . . .  ,3^  is  a  random  sample  of  size  m  with  cumulative 

distribution  function  (cdf)  F(x-0) ,  where  0  is  the  unique  median, 
F  is  continuous,  and  has  density  f,  symnetric  about  0,  and  which 
satisfies  /f^(x)dx<«°. 

(ii)  Let  Gg(t)  denote  the  cdf  of  (Xj+X2)/2.  Let  fp  denote  the  pth 

quantile  of  F(x-0),  0<p<l.  Suppose  G0  is  twice  differentiable  in 

I  If 

a  neighborhood  of  f  p,  with  G0=gg  positive  and  G0  bounded  in  the 
neighborhood . 

Remark  2.1.  If  f  is  symnetric,  absolutely  continuous,  and  has  finite 

00  1  2 

Fisher’s  information  /^(f  (x)/f(x))  f(x)dx  <  then  the  Assumptions  2.1 
concerning  F  and  GQ  are  satisfied.  For  a  proof,  see  Aubuchon  (1982,  p.19). 

The  Wilcoxon  confidence  interval  for  0,  the  true  population  median,  is 
derived  by  inverting  the  acceptance  region  of  a  size  «=2Pe(Tm(0) <d(m) )  = 
2Po<Tm(0)<d(m) )  Wilcoxon  signed  rank  test  where 

m 

Tm(0)  =  Z  R( |Xi~0| ) *1 {XA>0}  =  Z  Z  I  ( (Xi+Xj )/2  >  0}  (2.1) 

i=l  lSi<j<m 

with  I {A}  denoting  the  indicator  function  of  the  event  A  and  R(|X^-0| ) 

representing  the  rank  of  |Xi~0|  among  the  ordered  |Xp0|  ,  j=l,...,m.  The 

m(m+l)/2  averages  (X-+X-)/2,  l<i<j<m,  are  referred  to  as  the  Walsh  averages. 

^  J 


Let  M  =  m(m+l)/2  and  let 


represent  the  ordered  Walsh  averages.  In  view  of  the  second  equality  in 

(2.1), 

d(m)  £  Tm(0)  <  m(m+l)/2  -  d(m)  if  and  only  if 

[Z(d(m))'  Z(M-d(m)+l))  c011**1118  (2*3) 

Since  we  assume  F  is  continuous,  we  take  the  Wilcoxon-interval  for  0  to  be 
the  closed  interval 

[Z(d(m))’  Z(M-d(m)+l)1,  (2,4) 

Hie  d(m) -value  will  be  called  a  depth  since  it  specifies  how  deep  into  the 
ordered  Walsh  averages  the  endpoints  lie.  It  is  clear  that  this  interval  has 
exact  confidence  coefficient  r=l-«  and  is  distribution  free  (since  the 
Wilcoxon  signed  rank  test  statistic  is). 

Define, 

cyx)  :  HI  { (Xi+Xj )/2<x)/( “)  (2.5) 

l<i<j<m 

and  note  that 

G^x)  =  2Tm(0)/m(m+l) 

=  G  (x)  +  0 ( m“ 1 ) . 


(2.6) 


Now  a  slight  adaptation  of  Lenina  4.2  of  Geertsema  (1970)  yields  the  following 
lemma: 


Lenina  2.1.  Under  Assumptions  2.1  (with  fp=0),  if  {1^}  is  any  sequence  of 

-1/2 

integers  satisfying  2km/m(m+l)  =  1/2  +  0(m  logm)  as  m  then  wpl 


=  9  +  [2ka/m(m+l)  -  C^(0)]/Gg(0)  +  0(m"3/4logm) 


(2.7) 


where  G^jj  is  defined  in  (2.6)  and  Z 


is  def ined  in  (2.2). 


We  have  at  once  the  following  almost  sure  representation  of  the  endpoints  of 
the  Wilcoxon- interval.  The  constant  is  suggested  by  the  normal 
approximation  used  with  the  Wilcoxon  statistic. 


Theorem  2.1.  Under  Assunptions  2.1,  wpl 


Z(d(m))  =  0  ~  ka/((12m)1/2/f2(x)dx)-[Gm(0)-l/2]/(2/f2(x)dx)+o(m'1/2) 

(2.8) 

Z(Mm-d(m)  +  l)  =  0  +  V((12m)1/2/f2(x)dx)-[Gm(0)-l/2)/(2/f2(x)dx)+o(m"1/2 
where  d(m)  =  m(m+l)/4  +  .5  -  k  (m(m+l)(2m+l)/24)^2. 


Proof:  We  prove  the  theorem  for  the  lower  endpoint  Z^m^. 
argunent  holds  for  the  upper  endpoint. 


A  similar 


It  is  easily  shown  that  d(m)/(m(m+l )/2)  =  l/2-ka/(3m)1/2+o(m'1/2)  =  1/2 
+  o(m~^2logm) .  Now,  G0(0)  =  2/f2(x)dx,  and  observe  that  0(m-2/^logm)  = 
o(m~^2 ) .  Hie  theorem  follows  immediately  from  the  preceding  lemma. 

Remark  2.2.  Under  Assumption  2 . 1 ( ii ) ,  Geertsema’s  result  and  hence  Theorem 
2.1  can  be  easily  extended  to  include  any  quantile  ,  0<p<l. 

Let  L  =  Z(d(m))  and  U  =  Z(M_d(m)+1) ,  the.  lower  and  upper  endpoints  of 
the  Wilcoxon- interval  (2.4).  As  a  corollary  to  Theorem  2.1,  we  have  the 
following: 

Corollary. 

As  nr*», 

(i)  m1/2(L-0)  ?  Z  ~  Tj(-ka/121/2/f2(x)dx,  l/12(/f2(x)dx)2) 

2 

where  tj(m»o  )  denotes  the  normal  cdf  with  parameters  m  and 
o2. 

(ii)  m1/2(U-L)  -»  ka/(31/2/f2(x)dx)  wpl .  (2.9) 

(iii)  P(0<L)  -*  ♦(-ka)  =  a,  where  ♦(•)  is  the  standard  normal  cdf. 

Proof :  To  prove  part  (i),  we  use  the  fact  that  G^C • )  (2.5)  is  a  U-statistic 

1/2  D 

By  a  theorem  due  to  Hoeffding  (Serfling;  1980,  p.192),  m1/^(Gm(0)-l/2)  *» 

i  /2  "  G 

tj(0,1/3).  Slutsky’s  Theorem  along  with  (2.6)  implies  m1/^(G  (0)-l/2)  -» 
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■»!< 0 , 1/3 ) .  Now,  (i)  follows  at  once  from  <2. 8)  and  Slutsky’s  theorem.  Part 
(ii)  is  imnediate  from  (2.8).  In  part  (iii),  write  P(0<L)  as  P(O<m*^(L-0) ) 
and  apply  part  ( i ) . 

Hence,  if  the  Wilcoxon-interval  (2.4)  is  defined  by  depths 

d(m)  =  m(m+l)/4+.5-ka(m(m+l) ( 2m+l)/24) (2.10) 

the  approximate  confidence  coefficient  is  y  =  1-2* (-ka)  =  l-2a . 


-9- 


3.  The  Two-Sample  Inference 

Let  Xj , . . .  .Xjjj  and  Y^ ,  •  •  •  >Yn  represent  independent  random  samples  from 
F (x-0  )  and  F(y -9  )  respectively  where 

X  j 

(i)  9  and  9  are  the  unique  medians  of  the  X  and  Y  populations 

x  y 

(ii)  F  satisfies  the  Assunption  2.1 

(iii)  G0  (tx)  =  P((X1+X2)/2<tx)  and  G0  (ty)  =  P(  (Yj+Y^^ty)  satisfy 
x  y 

the  Assumption  2.1. 

Further,  with  M  =  m(m+l)/2  and  N  =  n(n+l)/2,  let 


ILx’Ux]  =  [Z(d(m))'Z(M-d(m)+l)]* 


and 


(3.1) 


[Ly,Uy]  =  [Z(d(n))'Z(N-d(n)tl)] 


represent  Wilcoxon- intervals  (2.4)  constructed  on  the  X  and  Y  samples, 
respectively,  where 

(i)  the  depths  d(m)  and  d(n)  are  defined  as  in  Theorem  2.1,  however, 
with  kx  and  ky  replacing  the  ka,  and 
(ii)  7X  =  l-2P(Tm(0)<d(m>)  and  yy  =  l-2P(Tn(0) <d(n) )  denote  the 
respective  confidence  coefficients. 


To  test 
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H  :  A  =  0  -0v  =  0  vs.  H.:  A  *  0 
o  y  x  a 

we  reject  HQ  if  the  two  Wilcoxon- intervals  are  disjoint.  That  is, 

if  Ux  <  Ly  or  Uy  <  l^.  (3.2) 

We  wish  to  pick  the  two  Wilcoxon-intervals  so  that  if  the  intervals  are 
disjoint  we  reject  Hq:  a=0  in  favor  of  HA:  A^O  with  a  specified  significance 
level  <*c. 

Recall  from  the  Corollary  to  Theorem  2.1,  part  (iii),  that  the 

approximate  confidence  coefficients  are  7x=l-2$(-kx)  and  7y=l-2$(-ky) , 

respectively.  The  following  theorem  relates  «c,  the  comparison  rate,  to 

and  k,  which  determine  y  and  y  .  It  therefore  indicates  how  to  pick  y  and 
y  x  y  x 

7y  so  that  we  achieve  ac  (or  7C=1-«C)  at  least  in  an  approximate  sense. 

Theorem  3.1.  Suppose  that  m,n-*»  so  that  m/(m+n)-»X,  0<X<1.  Then  under  Hq: 
A=0 ,  with  the  decision  to  reject  given  by  (3.2), 

«c  =  PfVV  +  P(Uy<Lx)  *  2*(-(1-x>1/2kx-xl/2ky)»  <3. 3) 

where  $(•)  is  the  standard  normal  cdf . 

The  proof  is  the  same  as  that  of  corollary  to  Theorem  2.1  of  Hettmansperger 
(1984). 

Define  kQ  by  ac=2$(-kc).  Then  from  (3.3)  and  ky  must  satisfy  the 


condition 


kc  =  (l-xr^kj^x^ky.  (3.4) 

Clearly,  there  Eire  infinite  nianber  of  choices  for  ^  and  ky,  hence  for  rx  and 
ry.  We  will  discuss  two  of  these  choices  later. 

The  test  can  be  based  on  a  confidence  interval  for  A=0y-0x.  An 
equivalent  decision  is:  Reject  Hq:  A=0  in  favor  of  H^:  6&0  if 

0  is  not  contained  in  [Ly-Ux,  Uy-L^ ] .  (3.5) 

Hence,  if  «c  is  the  significance  level  of  the  test,  then  the  interval  in 

(3.5)  has  confidence  coefficient  rc=l_<*c-  We  therefore  take  the  two-sample 

confidence  interval  for  A=0  -0  to  be 

y  x 

=  [Z(d(n))“Z(M-d(m)+l)'Z(N-d(n)>l)_Z(d(m))]-  (3,6) 

This  interval  has  approximate  confidence  coefficient  rc=l-2$(-kc)  where  kc  is 
def ined  in  (3.4). 

Remark  3.1.  The  present  procedure  is  based  on  the  ordered  dependent  Walsh 
averages  {(X^X^/2;  l<i<j<m)  and  {(Y.+Yj)/2;  l<i<j<n) .  Hence,  we  are  unable 
to  formulate  the  procedure  in  terms  of  statistics  whose  distributions  are 
tractable.  We  must  rely  on  the  normal  approximation  to  the  size  «c.  For  a 
second-order  approximation  based  on  Edgeworth  expansions  see  Tableman  (1984). 
The  following  theorem  which  is  similEir  to  Theorem  2.3  of  Hettmansperger 


(1984),  gives  the  asymptotic  length  of  the  two-sample  confidence  interval 


(3.6)  which  is  a  measure  of  its  efficiency  and  hence  of  the  test  from  which 
it  is  derived.  The  asymptotic  length  does  not  depend  on  1^  or  ky  but  rather 
on  kc  as  defined  in  (3.4). 

Theorem  3.2.  Suppose  m,n-*»  so  that  m/(m+n)-»X,  0<X<1.  Let  A(TO) 
denote  the  length  of  the  two-sample  confidence  interval  given  in  (3.6). 

Then,  wpl 

(m+n)1/2A(TW)  -*■  kc/(3X( 1-X) ) 1/2/f2(x)dx.  (3.7) 

We  note  that  not  only  do  all  the  intervals  (obtained  through  the 
defining  relationship  (3.4))  have  the  same  asymptotic  length,  but 
N*^2A(TW)/(2kc)  converges  to  1/ ( ( 12X( 1-X) ) ^2/f2(x)dx  wpl.  This  limit  is  the 
reciprocal  of  the  Pitman  efficacy  of  the  Mann-Whi tney-W i lcoxon  two-sample 
test;  see  Hettmansperger  (1984,  p.163).  Hence,  the  tests  based  on  the 
Wilcoxon-intervals  have  the  same  efficiency  properties  as  the 
Mann-Whi tney-Wilcoxon  test. 

As  mentioned  previously,  there  are  an  infinite  number  of  ways  to  choose 
the  intervals  so  that  the  two  sample  test  has  asymptotic  size  «.  For  the 
case  of  sign- intervals  Hettmansperger  (1984)  considered  restricting  the  two 
intervals  to  having  the  same  confidence  coefficient  or  having  the  same 
asymptotic  length.  The  same  solution  applies  to  the  Wilcoxon-intervals.  In 
particular,  we  recommend  the  equal  confidence  coefficients  solution  because, 
as  indicated  in  Table  1  of  Hettmansperger  (1984),  the  common  value  of  the 
confidence  coefficients  does  not  change  much  as  the  ratio  of  sample  sizes 
varies  between  1  and  3.  Other  solutions  are  not  as  stable. 


Hence,  we  recommend  using  two  84%  Wilcoxon-intervals  for  a  5%  two-sided, 
two-sample  test  or  two  75%  Wilcoxon-intervals  for  a  10%  two  sided,  two-sample 
test.  The  later  case  provides  for  a  5%  one-sided,  two-sample  test. 

Example.  We  illustrate  the  ideas  on  a  small  data  set.  Suppose  we  wish  to 
compare  the  growth  of  mushrooms  with  and  without  vitamin  B1  in  their  diets. 

The  data  given  in  Table  1  has  been  altered  for  illustration  purposes  from  that 
given  by  Lehmann  (1975,  p.47).  The  measurement  is  weight  gain  in  milligrams 
at  the  end  of  an  observation  period. 

-  Table  1  about  here  - 

Let  A=ey-0x,  the  difference  between  the  treatment  (Bl)  and  control  (NO  Bl) 
population  medians.  We  assume  the  weight  gain  populations  are  symnetric.  To 
test  Hq:  A=0  vs  Ha:  A*0,  we  could  use  the  Mann-Whitney-Wilcoxon  rank  sun 
test.  Using  the  MANN  command  in  Minitab  (see  Ryan,  Joiner  and  Ryan  (1985)), 
we  find  the  sun  of  the  ranks  (breaking  ties  by  averaging)  of  the  treatment 
group  is  141.5  with  a  two  sided  p- value  of  .0065.  Hence  we  reject  HQ:  A=0  at 
«<.01. 

A  more  informative  analysis  is  based  on  the  comparison  of  two  84% 

Wilcoxon  confidence  intervals  for  9  and  9  .  Figure  1  provides  the  notched 

x  y 

box  plots  for  the  two  groups.  Comparing  the  85%-Wilcoxon  notches  leads  to 
rejection  of  Hq:  A=0  at  the  5%  level  since  they  are  disjoint. 


-  Figure  1  about  here  - 


Further,  we  have  the  intervals  and  estimates  for  0y  and  0X  as  [Ly.,Uy]  = 
[23.0,28.5],  0y=25.5  and  [1^,11^]  =  [15.0,20.5],  Sx=18.0,  respectively.  The 

a 

estimates  0X  and  0y  are  the  Hodges -Lehmann  (1963)  estimates  computed,  as  the 
median  of  the  Walsh  averages.  In  addition,  a  95%  confidence  interval  for  A 
is  given  by  [Ly-U^Uy-I^]  =  [2.5,13.5]  with  A  =  0y-©x  =  7.5.  The  figure 
along  with  the  intervals  and  estimates  are  available  using  the  BOXPLOT  and 
WINT  commands  in  Mini  tab. 

Note  that  the  Hodges - Lehmann  estimate  and  95%  confidence  interval  for  A 

A 

derived  from  the  Mann -Whitney  statistic  are  A  =  8.0  and  [3.0,13.0], 
respectively.  These  differ  slightly  from  the  Wilcoxon- interval  comparison. 

Eighty  five  percent  sign-notches  are  not  available  for  the  given  sample 
size.  Hie  closest  confidence  coefficient  to  85  percent  is  89  percent  and 
yields  [22.0,29.0]  and  [14.0,23.0]  for  B1  and  NO  B1  groups,  respectively. 
These  intervals  overlap  and  would  fail  to  reject  Hq:  A=0  at  some  level  less 
than  .05.  If  the  distributions  are  not  too  heavy  tailed  then  the  Wilcoxon 
notches  and  the  Mann - Wh i tney- W i 1 coxon  test  are  more  efficient  than  the 
sign-notches  and  this  may  result  in  a  failure  of  the  sign-notches  to  detect  a 


significant  difference  when  the  other  methods  will  detect  it. 


4.  The  Effects  of  Asymmetry  on  the  Two-Sample  Inference 

As  remarked,  in  Section  2,  Geertsema ’ s  result  can  be  extended  to  include 
any  quantile  fp,  0<p<l.  We  state  here,  without  proof,  the  form  most  useful 
for  our  discussion.  We  note  that  the  result  does  not  rely  on  the  symmetry 
of  F. 

Theorem  4.1.  Under  Assumptions  2.1  (i),  (ii)  (but  with  symmetry  on  F 
dropped),  if  (1^)  is  any  sequence  of  integers  satisfying  2km/m(m+l)  = 
G0(fp)+k/m1^^+o(m-^^)  as  m*>»,  then  wpl 

z(km)  =  vk/n,1/2Ge<v  *  +  °<m"1/2)* 

Let  0*  denote  the  median  of  G0(‘)*  Note  that  under  asymmetry,  0*#0. 
With  the  depth  d(m)  defined  as  in  Theorem  2.1,  we  have  2d(m)/m(m+l)  = 
l/2-k/(3m)1^2+o(m-1^2)  as  nr*®°.  But  now,  1/2=G0(0*).  Hence,  Theorem  4.1 
gives  us  the  following  almost  sure  representation  of  the  endpoints  of  the 
one-sample  Wilcoxon- interval.  They  differ  from  those  given  in  Theorem  2.1. 

L  =  0*-k/(3m)l/2G0(0*)-(Gm(0*)-l/2]/Gg(0*)+o(m~1/2) 

and  (4.1) 

U  =  0*+k/(3m)1/2Gg(0*)  -(Gm(0*)-l/2]/Gg(0*)+o(m_1/2). 

Before  proceeding  to  the  two  sample  inference,  we  make  the  following 


remarks: 


Remarks  4.1. 


(i)  When  F  is  not  symmetric,  0*  differs  from  0  by  a  constant  a;  that 
is,  0*(0)=0+a.  This  follows  from  the  fact  that  0*  is  such  that 

00 

Gfl(0*)=f  F(20*-20-x)f(x)=l/2. 

^  _oo 

Now,  differentiate  with  respect  to  0  to  obtain  d(0*(0)/d0=l . 

Hence,  G0(0*)  =  2/  f ( 2a-x)f (x)dx. 

(ii)  It  follows  from  Theorem  A  in  Serf ling  (1980,  p.192)  and  from  (2.6) 
that  m1/2[^n(0*)-l/2]/Gg(0*)  5  Z~n(0; (»-l/4 )/(/  f (2a-x)f (x)dx)2] 

oo 

00  ? 

as  nt*>°,  where  v  -  /  [F(2a-x)  ]cf  (x)dx. 

— OO 

We  now  return  to  the  two-sample  inference.  Let  0*  and  9*  denote  the 
medians  of  G0X(  • )  and  G0y.(  • ) ,  respectively.  Under  the  same  shape  assuaption 

on  the  underlying  distributions,  0*=0  +a  and  0*=0  +a.  It  is  clear  that 

XX  J  J 

testing  H_:  0  =0V  is  equivalent  to  testing  H  :  0*  =0* ,  and  that  A=0*-0*.  We 
u  y  a  o  y  a  y  x 

are  now  in  a  position  to  approximate  «Q,  the  comparison  rate  of  the  test 
(3.2),  and  to  determine  the  asymptotic  length  of  the  two-sample  interval 
(3.6)  for  A  when  F  is  asymnetric .  These  results  are  stated  in  the  next 
theorem  and  they  follow  immediately  from  (4.1)  and  Remarks  4.1. 

Theorem  4.2.  Suppose  that  m,n-»»  so  that  m/(m+n)->A,  0<A<1. 

(i)  Then  under  Hq:  a=0,  with  the  decision  to  reject  given  by  (3.2) 


«c  -»  2*{-kc/(  12( v-1/4)  )  '  ) . 


(ii)  Let  A (TO)  denote  the  length  of  the  two- sample  confidence  interval 


given  in  (3.6).  Then,  wpl, 


(m+n)1/2A(TO)  -*  k _/[ (3X( 1-X) )1/2/°f (2a-x)f (x)dx] . 


Remarks  4.2. 


(i)  Suppose  k  is  the  critical  value  that  provides  a  size-<x 
c  c 

two-sided  test  when  F(  • )  is  symnetric.  Although  i/=l/3  when  F(  • ) 
is  symmetric,  for  asymmetric  distributions  we  have  1/4<v<1/2.  We 
see  from  part  (i)  of  Theorem  4.2  this  can  result,  theoretically, 
in  increasing  the  size.  However,  Tableman  (1984)  simulated  the 
two-sample  test  (3.2)  for  samples  of  small  to  moderate  sizes  from 
chi-square  distributions  with  4,  18,  30  degrees  of  freedom  and 
from  the  standard  normal  distribution.  No  evidence  was  found  that 
asymmetry  (in  this  case,  right  skewness)  effects  the  size. 


(ii)  Part  (ii)  of  Theorem  4.2  reveals  that  the  main  effect  of  asymmetry 
is  the  possibly  significant  reduction  in  power.  This  follows  from 

oo 

the  fact  that  for  certain  asymmetric  densities  /  f (2a-x)f (x)dx  can 

— oo 

be  quite  small.  We  also  note  that  the  Pitman  efficiency  is  no 
longer  that  of  the  Mann-Whitney  two-sample  test.  In  this  setting, 

the  efficacy  of  that  test  remains  ( 12 ( 1-X)X) ^2/  f2(x)dx,  and  hence 


«  « 

may  be  much  more  efficient  since  /  f (2a-x)f(x)dx<f  f^(x)dx. 
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